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ABSTRACT 

We use mock Sunyaev-Zel'dovich effect (SZE) maps to investigate how well the Planck mission might 
find and characterize clusters of galaxies. We discuss different combinations of frequency maps and 
different methods for identifying cluster candidates. For the simplest methods, the catalogues are not 
complete even for relatively high mass thresholds, but the full sky nature of the mission ensures a large 
sample of massive, high-z clusters which will be ideal for many studies. We make a preliminary attempt 
to identify the X-ray, optical and weak lensing properties of the Planck sample. 

Subject headings: Galaxies-clusters, cosmology-theory 



1. INTRODUCTION 

The Planck mission^ will provide all-sky maps of su- 
perb resolution at 9 frequencies, ranging from 30 GHz to 
850 GHz, with unprecedented signal-to-noise. One of the 
science goals that this enables is the construction of a cat- 
alogue of galaxy clusters detected through the Sunyaev- 
Zel'dovich effect (SZE: Sunyae v fc Ze l'dovich fWTS.TMJI; 
for recent reviews see Rephaeli 1 1 9 9 51 and Birkinshaw 1999 ) 
and a study of the residual SZ emission once these clus- 
ters are removed. In this paper we investigate some of the 
ways a survey with the characteristics of the Planck sur- 
vey will advance our understanding of cluster physics and 
cosmology, building on the earlier work of many authors 
(Aghanim et al. 1997, Kay, Liddle & Thomas 2001: Vielva 
et al.EOnil Herranz et al. .2002. Diego et al. 2003: Hobson 
& McLachlan 1^003). We discuss the sample Planck will 
select, what observations it can give of previously known 
clusters, and some of the properties of the clusters which 
will be useful for foUowup. 

Exploring the expected Planck sample is a complex 
problem, involving several different but interlocking facets. 
Our aim here is to focus on one, while treating the oth- 
ers in as simple a manner as is consistent with what we 
know. This investigation is thus by nature preliminary, 
but we believe that the simulations upon which it is based 
provide an advance over what has been used in the past 
and illuminate several issues future Planck work will need 
to address. We use mock SZ maps drawn from a large 
volume, high resolution N-body simulation. These maps 
capture much of the physics behind the SZ effect, with the 
sources (groups and clusters) situated correctly in their 
cosmological context. Due to the current uncertainty in 
the amplitude of both the SZ signal and the relevant astro- 
physical foregrounds, a detailed modeling of the 'noise' is 
not possible. However we shall investigate signal-to-noise 
levels which should span the allowed range. Our investi- 
gations lead us to conclude that constructing a catalogue 
of clusters from Planck will be a difficult undertaking, but 
one which is worth a great deal of effort. Which are the 
optimal methods for foreground subtraction, cluster find- 
ing and modeling the selection function are all open ques- 
tions currently, and this work will only begin to explore 

-"^http: / /astro. estec.esa.nl/Planck/ 



some of the relevant issues. Also uncertain is the most 
efficient way of collecting relevant information during the 
pre-launch phase of Planck. 

The outline of the paper is as follows. In we de- 
scribe our methodology for isolating the SZ signal from 
the other astrophysical components (including the CMB) 
in our simulated observations, and the simulations we are 
using to model the SZ effect itself. We describe two meth- 
ods for finding cluster candidates in our simulated maps 
in fJSland how well they do in ^ We discuss using the 
lower frequency channels of Planck in order to reduce IR 
point source contamination in f|51 In [jS] we describe the 
properties of our cluster sample from the point of view of 
X-ray, optical and weak lensing foUowup. The dependence 
of our results on our cosmological and foreground model- 
ing is discussed in SQand our conclusions are presented in 

m 

2. SIMULATED OBSERVATIONS 

The thermal SZE signal is likely to be dominated by 
virialized objects whose typical angular size is around 1'. 
Thus if we can achieve a suitably low noise level, including 
foreground subtraction, the Planck channels which will be 
best suited to studying the SZE signal from galaxy clus- 
ters will be those at higher frequency, where the angu- 
lar resolution is best. If it turns out that high frequency 
foregrounds are particularly troublesome, and cannot be 
removed to a suitable level, we may need to consider mov- 
ing to lower frequencies, and resolutions, where the 'noise' 
is lower. We shall consider both approaches here, begin- 
ning with the high frequency option. We shall return to 
the lower frequency option in 

The first step in studying the SZE is to use the multi- 
frequency capability of Planck to isolate the SZ signal from 
that of the foregrounds and the background (see Herranz 
et al. 120021 for a summary of recent work and Gomez et 
al. ^2002 for a recent example using real data). We expect 
the foregrounds at high and low frequency to be differ- 
ent, with only the CMB in common across the channels. 
We shall phrase the effects of foreground removal in terms 
of an increase in the effective "noise" of our SZE maps. 
This is appropriate in the sense that finding clusters in 
Planck maps is a very local procedure. The clusters are 
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FlG. 1.— One of our simulated maps: AT(353) - AT(217). (Top) 
the input SZ signal, converted to AT at 353 GHz but without noise 
or smoothing. (Bottom) the same map, with 40 //K of noise and 
5' smoothing. In both panels the greyscale is linear in AT ranging 
from -100 //K (white) to +100 /xK (black). 



(at most) a few arcminutcs on a side and the map cov- 
ers 40,000 square degrees so mostly what matters is the 
S/N in the map at the position of the cluster. By quan- 
tifying the required S/N in this way we are attempting 
to decouple cluster finding from the details of foreground 
subtraction. Isolating the different parts of the problem 
in this way is appropriate during this exploratory stage 
and our current knowledge of the relevant foregrounds is 
sufficiently imprecise that our simplistic treatment offers 
several advantages. 

Let us first consider the SZE 'signal', then the CMB 
'background' and then the other astrophysical 'foregrounds'. 
While in general we expect the foregrounds to have un- 
known, spatially varying spectral characteristics, for SZE 
studies we are aided by the fact that the spectral shape of 
our primary CMB 'background' and SZE 'signal' are very 
well known. In thermodynamic units the thermal SZE 
scales, for non-relativistic Tg, as 



AT / e'^-hl ; 

~ —2y for X ^ 1 , 



(1) 
(2) 



where x = hv/kTcuB — !^/56.85GHz is the dimension- 
less frequency, and the second expression is valid in the 
Ray leigh- Jeans limit. The quantity y is known as the 
Comptonization parameter and is given by 
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where the integral is performed along the photon path. 

Since Tp ^ ?cmb the integrand is proportional to the 
integrated electron pressure along the line of sight. For 
frequencies below approximately 217 GHz the SZE is a 
decrement in the CMB while above this frequency it is 
an increment. 

Obviously the CMB has a constant temperature in ther- 
modynamic units. In removing this background we could 
in principle use many of the Planck channels, however the 
differing beam sizes represents a potential problem if we 
wish to work at high angular resolution. All of the chan- 
nels above 217 GHz have a 5' FWHM beam, the lower 
frequency channels have beams ranging from 33' (30GHz) 
to 7' (143GHz). The fluctuations measured at any point 
on the sky by a beam of width 9 differs from that measured 
with 5' resolution by a 'noise' 



47r ^ — ' 



(4) 



where Ce. is the angular power spectrum of the CMB at 
multipole moment i and is the (Legendre) transform 
of the beam. For a COBE normalized ACDM model 
this typically ranges from 5 /xK at 7' through 14 /iK at 
10' to 50 /iK at 33' (we use thermodynamic temperatures 
throughout). The difference in the CMB signals measured 
by the 217 GHz channel and frequency channels below 
100 GHz is thus comparable to the noise (13 /iK) in the 
217 GHz channel itself. Thus only the 143 GHz channel 
provides any strong additional estimate of the CMB tem- 
perature at 5' scales, the lower frequency channels serve 
primarily to guard against low frequency foregrounds. 
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At 5' Planck has 4 channels: 217 GHz, 353 GHz, 545 GHz 
and 857 GHz. For simpHcity we shall neglect the finite 
width of the Planck bands. If the frequency dependence of 
the signals is known and the bands are well characterized 
(Church, Knox & White 2003) the bandwidth presents no 
additional problems in principle, and we lose nothing by 
neglecting it here. 

Finally we consider the astrophysical foregrounds, which 
we assume will be dominated by extra-galactic point sources 
at the relevant frequencies. We therefore implicitly assume 
that any dust emission has been removed using a combina- 
tion of the high frequency channels and/or spatial filtering, 
or we are working in a (rare) low dust region of the sky. 
If the frequency dependence of the dust is known, or we 
need only fit a single extra parameter (e.g. a power-law 
index for hv <C fcTdust) the high frequency channels form 
an excellent dust monitor even on a pixel- by-pixel basis. 
In fact, detailed simulations suggest that dust can be re- 
moved quite well from the Planck maps (Vielva et al. 120011 
Table 2 or Stolyarov et al. U002. Table 2). 

The extra-galactic source counts at 353 GHz can be 
taken from observations with the Submillimeter Common- 
User Bolometer Array (SCUBA; HoUand et al. I1999|l on 
the James Clerk Maxwell Telescope. SCUBA has been 
used to make several deep observations (Barger et al.!1998! 
Bales et al. 1998 Holland et al. 1998- Hughes et al. iil998t 
Small et al. ll997|l from which we can extract source counts. 
Using the model in Scott & White H1999|l we predict that 
residual point sources should contribute around 70 fiK per 
5' beam at 353 GHz if we subtract all sources brighter than 
4-5(7 at 545 and 857 GHz. The source counts, and hence 
this normalization, are uncertain at the 50% level. An ad- 
ditional uncertainty comes from the fact that the sources 
relevant to Planck are the bright end of the SCUBA pop- 
ulation. Since SCUBA has surveyed only a small area of 
sky, our extrapolations are uncertain, especially if the na- 
ture of the brightest sources differs from the majority of 
SCUBA sources. 

We implicitly assume that the sources are uncorrelated 
with the clusters that Planck will detect. At low fre- 
quencies we know radio point sources can be correlated 
with clusters, but we do not know much about the sub- 
population which could be detected above 100 GHz. Ex- 
trapolations from the recent WMAP data (Bennett et 
al. i2003i) suggest radio sources will be a negligible con- 
taminant above 100 GHz. If the IR sources arise from high 
redshift, as current theories suggest, they are likely to be 
uncorrelated with massive clusters. However this assump- 
tion is an idealization which needs to be revisited in future 
work. 

Our procedure is as follows: of these 4 high resolution 
bands we will use the two higher frequency maps to veto 
or remove the foregrounds and brightest point sources (if 
their spectrum is known). We use the first two bands to 
isolate the CMB signal from the SZE signal. For primary 
CMB anisotropics and SZ signal, the signal- independent, 
unbiased, minimum variance estimate of the SZ compo- 
nent is simply the difference^ of the two channels, as we 

^If the channels have finite width a recalibration is also required, 
but this is a technical detail which need not concern us here. 



would have naively guessed. We will therefore use the sim- 
plest possible filtering scheme: differencing the 353 GHz 
and 217 GHz channels. The difference map is dominated 
by the noise in the 353 GHz channel, which is roughly 
40 /iK per beam. We assume that foreground contami- 
nation in the 217 GHz channel is dominated by that in the 
353 GHz channel and that the foregrounds become worse 
as we go to higher frequencies. 

Specifically our first results will be drawn from maps 
which consist of SZE, converted to AT at 353 GHz and 
smoothed to an angular resolution of 5', plus noise. Con- 
sistent with our philosophy of isolating different physical 
effects, we model the pixel noise as Gaussian, white noise 
and choose three noise"^ levels: 20 /iK, 40 /iK and 80 /iK. 
The first corresponds to the noise near the poles where the 
integration time is the largest and we assume the sky is ab- 
normally clean of point sources or perhaps if we addition- 
ally make use of the 143 GHz channel. It is likely a lower 
limit to the noise once additional complicating effects are 
included in the analysis. The second is our fiducial noise 
from the difference of the 353 GHz and 217 GHz channels. 
The third includes the contribution from extra-galactic 
point sources, using the model above, assuming that the 
sources which are not detected at 545 or 857 GHz can be 
modeled simply as additional Gaussian noise at 353 GHz. 
The assumption of Gaussianity implicitly assumes a 'cen- 
tral limit' argument, the treatment of the additional signal 
as noise assumes the foregrounds are uncorrelated with the 
CMB or SZE signal and that we don't know the true sky 
monopole (and so subtract a mean from all of the maps). 
Our treatment should therefore be regarded as a simple 
1-parameter "effective" noise level which will set a target 
for foreground removal methods to meet. We do not con- 
sider higher levels of noise/foregrounds as the cosmological 
signal becomes progressively harder to extract. 

We construct maps of the SZE effect at various frequen- 
cies using the method outlined in Schulz & White H2003|l . 
The maps are created from a large volume, high resolution 
N-body simulation containing a 'fair sample' of the uni- 
verse, in a manner tuned to reproduce the results of the 
hydrodynamic simulations reported in White, Hernquist 
& Springel H2002|l . The normalization has been adjusted 
to pass through the lower envelope of the CBI deep field 
results reported in Mason et al. (20Q3J and through the 
BIMA point (Dawson et al. 12001]) at higher £. This is close 
to the power seen in more recent CBI data (A. Readhead, 
private communication) and a factor of approximately 4 
(in power) larger than would be predicted by the simula- 
tions of the 'concordance' cosmology reported in White, 
Hernquist & Springel ( 2002]) - we shall discuss lower nor- 
malizations in SJTI The N-body maps are not as realistic as 
those produced using full hydrodynamic simulations, but 
they come from a larger volume to provide a better sample 
of the rare, high mass clusters. By using such simulations 
we ensure that our SZE sources are situated in their proper 
cosmological context, including their relation to the over- 
all growth of large-scale structure. We shall return to this 
issue in ^ 

We generate 10 maps, each 10° x 10° with 1024^ pixels, 

''As we discuss in J7|our uncertainties in both signal and noise 
argue against a more refined set of values at this point. 
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and for each map we produce from the underlying simula- 
tion a catalogue of the 3D positions and properties of the 
clusters in the field. While the maps, being drawn from 
the same underlying simulation, are not fully independent, 
they sample different linc-of-sight projections and cluster 
orientations. This is important in that our model contains 
aspherical clusters with a range of sizes, sub-structures and 
properties in a range of environments, providing a more 
realistic model of what Planck should see than randomly 
placed, sphericized, analytic models. 

To the maps we add Gaussian noise scaled to the pixel 
scale from the levels quoted above. Our noise is uniform 
and white, which will not be the case for the Planck mis- 
sion. However the noise variations are expected to be on 
larger scales than are relevant for finding point-sources, so 
our approximation shouldn't affect our results. An exam- 
ple of one of our maps, additionally smoothed by the beam 
to reduce the noise level per <C 5' pixel, is shown in Fig. ^ 

3. FINDING CLUSTERS 

We shall investigate two different methods for finding 
clusters in our simulated maps. The first simply fiags lo- 
cal maxima in the (smoothed) map. The second uses a 
matched filtering algorithm similar in spirit to the (more 
sophisticated) algorithm discussed in Hobson & McLach- 
lan H2003)) and in implementation to the matched filter 
described in White & Kochanek ( 2002.1 . 

The first, and simplest, method starts with a list of lo- 
cal maxima. Since our pixel scale is <C 5' we first smooth 
the maps to reduce the noise level per pixel. We choose 
an additional smoothing of the same size as the beam. 
Fortunately, our results are not very dependent on this 
choice. If we increase the smoothing slightly we find more 
nearby clusters at the expense of distant clusters. The 
optimal filtering appears to be 20 — 40% larger than the 
beam size, though it does depend on the noise level cho- 
sen. The smoothing filter could also be combined with a 
high pass filter (for example using the mexican hat wavelet; 
e.g. Cayon et al. 20001 Herranz et al. l2(J(J2|l without chang- 
ing our results. While an optimization of the filter would 
be an interesting exercise, the differences are slight and we 
shall neglect this extra degree of freedom here. Once we 
have a list of candidate clusters, we keep all whose peak 
value in the map lies above a given threshold, discussed 
below. We also keep track of which peaks match halos in 
our 3D catalogues. When matching peaks and halos we 
require that the center of the halo lie within 2' (4 pixels) 
of the peak maximum. 

The second method is a combination of a matched filter 
algorithm and a 'CLEAN'-like algorithm fHogbom ITFTH 
Clark [T^Snil . The key assumption is that clusters will look 
like point sources at the Planck resolution. Assuming that 
the clusters of interest are unresolved, we model clusters 
as Gaussian with the same width as the beam. We do not 
scan over a range of filter sizes as in Herranz et al. (2002). 
Given a map we evaluate the best fitting amplitude of a 
Gaussian for each pixel. The fit is done over 3' around the 
central pixel to avoid overweighting the outskirts of the 
cluster"*. Our results are not very sensitive to this choice. 

*Note that since we are fitting to a profile which is already 



The most significant detection is added to our peak list, 
and then the best fit Gaussian is subtracted from the map. 
This process is iterated until no further significant detec- 
tions (> 3ct) are found. One advantage of this method is 
that it can easily deal with edges and non-uniform noise 
while retaining the full resolution of the map (i.e. we avoid 
any additional smoothing). A disadvantage is that we 
implicitly assume the clusters are unresolved (i.e. beam 
shaped). As above, when matching peaks and halos we 
require that the center of the halo lie within 2' (4 pixels) 
of the peak center. 

4. RESULTS 

We present our results in terms of the completeness and 
efficiency (or reliability) of the method in finding clusters 
above a mass threshold. Completeness is the ratio of the 
number of clusters we found using the mock SZE observa- 
tion to the total number of massive clusters in the field of 
view. Out of the total number of cluster candidates that 
we identify in our SZE maps, only some of them will actu- 
ally be clusters with a mass above the threshold of interest. 
We allow more than one cluster to match a peak, under the 
assumption that any confirmation procedure would likely 
detect both clusters and thus we would have 'found' both. 
On the other hand, we make no correction for edge effects 
so our completeness will typically be underestimated by 
a few percent. Efficiency (or reliability) will measure the 
ratio of clusters found to the total number of candidates, 
and is a measure of the amount of contamination suffered 
when using the SZE technique. 

smoothed by the beam, we are however including emission from much 
larger than this radius. 
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Fig. 2. — The clusters found and missed by matching peaks in 
maps with 20 fiK of noise. We have kept all peaks above 50% reli- 
ability for halos more massive than 3 X lO'^* h~^MQ. Crosses indi- 
cate clusters in the field which did not match any peak above the 
threshold, open circles the clusters which lay within 2' of a selected 
peak. Crosses inside circles indicate clusters which were found in one 
orientation but missed in another (corresponding to different lines- 
of-sight with differing large-scale structure in projection etc). The 
solid lines show (left to right) lines of constant SZE flux (200mjy, 
400m Jy and 600mjy) in the idealized model described in the text. 
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Often completeness and efficiency are phrased not in 
terms of finding clusters above a fixed mass but rather in 
terms of finding clusters above a certain fiux threshold. 
Since there is scatter in the relation between mass and 
flux the conversion between these definitions is non-trivial. 
Even a sample which is 100% complete and efficient in fiux 
selection will be incomplete above a given mass threshold if 
the scatter is non-zero. In terms of the physical properties 
of the clusters that Planck finds the selection in terms of 
mass is most useful, and that is why we focus on it here. It 
is possible, indeed likely, that for characterizing the sample 
and selection effects at a later stage a flux limit may be 
more appropriate. 

Despite the added complexity we find that our matched 
filter method performs worse than the simple peak find- 
ing on smoothed maps, missing more of the high mass 
clusters. This suggests that at 5' our assumption that 
all of the SZE signal is unresolved, or that the peaks are 
beam shaped, is not true. To improve upon the method 
would require us to perform an expensive search through 
non-circular, arbitrary sized shapes. For this reason we 
shall concentrate here primarily on the results from the 
peak finding algorithm. We find that for all levels of noise 
and smoothing in the maps our efficiency /reliability drops 
rapidly as we lower the candidate threshold. We shall ad- 
just our threshold for each map so that our reliability is 
50% (and we have searched at least 10 peaks). The precise 
value this takes will depend on the noise, smoothing and 
the mass threshold for halos we are considering. For a cut 
at 5 X 10"'^'* H^^Mq these thresholds are approximately 40, 
80 and 160 /iK, with a slight variation from map to map. 
It is unlikely that a lower threshold would be chosen, so 
this is the most optimistic assumption in terms of finding 
clusters. 

We show in Fig. |5] the distribution of halos which are 
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Fig. 3. — The redshift distribution of clusters above 5 X 
10^* /i~^Mq. (Top) The soUd histogram shows all of the clusters in 
the 10 fields, which agrees quite well with the predictions of Press- 
Schechter theory (not shown) once masses are corrected to M2oo- 
The dashed and dotted lines show those clusters which lay within 
2' of a peak above the threshold for the indicated level of noise in 
the map. (Bottom) The fraction of clusters found. 



found in our lowest noise fields. We see immediately that 
Planck finds all of the highest mass clusters in the field 
out to z ~ 0.8, but only about half of the clusters at lower 
masses even with this optimistic noise level. Based on this 
figure we shall set our mass threshold for candidates at 
5 X 10^^ h~^MQ. This is somewhat larger than thresholds 
which have been quoted in earlier papers, but based on 
Fig.|21we expect a small fraction of the clusters less massive 
than this will be detected by Planck using this peak finder. 
We also show on Fig. |21 some lines of constant SZE 'flux', 
assuming infinite resolution. These lines are computed 
following Kay, Liddle & Thomas l(M7T|l Eq. (14), which 
gives the integral of the SZE over the virialized region of 
the (isothermal) cluster as 

^.(353 GHz) ^ 75mJy (^^^^^^iLj (5) 

K soo/^-W ) + 

where Da is the angular diameter distance. Beware that 
these lines do not include the effects of the finite resolution 
of Planck. They are included to show how closely the 
selection corresponds to an intrinsic fiux cut. 

There are some clusters which appear in Fig. [3 as both 
'found' and 'missed'. For these clusters the detection de- 
pends upon the orientation of the line-of-sight, and we 
have simulated several orientations. Due to a combina- 
tion of line-of-sight projection and asphericity effects the 
cluster is easier to find in some orientations than others. 
This kind of effect is absent in analytic models based on 
spherical clusters placed randomly in background fields. 

Finally we show the redshift projection in Fig.|31 Above 
5 X 10^* /i~^Mq the completeness is relatively good for 'lo- 
cal' clusters, but drops below 50% beyond z ~ 0.5. Since 
the volume is a rapidly rising function of redshift many of 
the Planck clusters will lie beyond z — 0.5, they will not 
however be a very complete sample. 

Our investigations suggest that the noise level is quite 
important in the Planck cluster yield. We find most of the 
massive clusters in the deepest, cleanest parts of the sky 
(our 20 /iK example). This degrades somewhat by 40 /iK. 
By 80 /iK we are finding very few of even the most mas- 
sive clusters. Our estimate of the unresolved point source 
contribution, from the SCUBA counts, suggests that the 
latter situation is the most likely over much of the sky. 

5. WORKING AT LOWER FREQUENCY 

If our estimates of the signal strength and foreground 
contamination are correct then (at least for simple peak 
finding) Planck will be 'noise' limited over most of the sky, 
with the noise dominated by unresolved IR point sources 
or residual dust emission. This suggests bringing in in- 
formation from lower frequencies, where these foreground 
contributions are less (see also the discussion in Herranz 
et al. [2I)T)2|ll . The lower frequency foregrounds would then 
have to be controlled by the channels at 100 GHz and be- 
low (which emphasizes the importance of the wide fre- 
quency coverage of Planck) or perhaps reduced by high- 
pass filtering of the maps to remove diffuse emission. 
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For both the 143 and 217 GHz channels the unresolved 
IR point source contributions are comparable to the in- 
strument noise in those channels while the lower frequency 
foregrounds are expected to be below the noise. Thus if 
we difference the 143 GHz and 217 GHz channels to iso- 
late CMB from SZ, and we assume that the lower fre- 
quency channels have been used to remove the residual 
low frequency emission, we will be dominated by the noise 
(and foregrounds) in the 217 GHz channel: we will use 
15 /iK per 7' pixel. Our SZ signal will now come from 
143 GHz which gives thermodynamic temperature fluctu- 
ations a factor ~ 2 lower at the same y value. Our proce- 
dure is exactly as outlined above, except that we increase 
the matching radius to 3' (6 pixels) since our smoothing 
length has correspondingly increased. This relaxed match- 
ing criterion, which allows 40% more sky area, should be 
borne in mind when comparing the different frequencies. 

At 7' the correspondence between peaks in the temper- 
ature map and massive clusters is becoming loose. The 
large smoothing means that the cluster signal is signif- 
icantly diluted and clusters in overdense regions have a 
higher chance of being included in the sample than rela- 
tively isolated clusters. Our matched filter now does as 
well as simply smoothing the maps and looking for local 
maxima, suggesting that the peaks are now well approx- 
imated by the shape of the beam, but it still does not 
perform significantly better so we shall stick to the sim- 
pler method for comparison with 21 Fig. 0] illustrates the 
situation in the same format as Fig. |5| The completeness 
is intermediate between the 5' cases with 40 /zK and 80 /^K 
noise across the entire range of redshifts and we do not 
show it explicitly. 

This suggests that a Planck sample obtained from the 



lower frequencies is indeed an option should the situa- 
tion at higher frequencies be closer to (or higher than) 
the 80 /iK figure than the 40 in the patch of sky un- 
der consideration. Viewed another way, the overlap of two 
catalogues produced at these two frequencies would pro- 
vide an important cross check. We defer consideration of 
optimal combinations of the channels to a future paper. 

6. FOLLOW UP 

The majority of the Planck cluster sample will be unre- 
solved, so the exploitation of the data for cluster science 
rests on their combination and correlation with indepen- 
dent data sets. For this reason, and more generally, it is of 
some interest to ask what are the optical and X-ray prop- 
erties of the cluster sample we have identified here. These 
properties will determine the optimal follow-up procedures 
and set the scope for any pre-launch work which could aid 
in cluster finding (see also Diego et al. 12002(1 . 

We will use simple models to make a rough estimate of 
the X-ray, optical and lensing properties of Planck clus- 
ters (see Table nj. Firstly, a cluster of W^^H'^Mq has 
a (ID) velocity dispersion of llOOkm/s at z = 0, scaling 
as {E{z)M2ooy^^ where E{z) = H{z)/Ho is the evolution 
parameter or dimensionless Hubble parameter. For the 
X-ray properties we make use of measured scaling rela- 
tions between mass, temperature and luminosity. Given 
the mass and redshift of a cluster we compute the clus- 
ter temperature from (Finoguenov, Reiprich & Bohringer 
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Fig. 4. — The clusters found and missed by matching peaks in 
our low-frequency maps with 15 /iK of noise. We have kept all peaks 
above 50% reliability for halos more massive than 5 X lO" h'^MQ. 
Crosses indicate clusters in the field which did not match any peak 
above the threshold, open circles the clusters which lay within 3' 
of a selected peak. Crosses inside circles indicate clusters which 
were found in one orientation but missed in another (correspond- 
ing to different lines-of-sight with differing large-scale structure in 
projection etc). 



where the redshift evolution has been taken to follow self- 
similar collapse. The conversion from temperature to lu- 
minosity is still somewhat uncertain, depending on as- 
sumptions made about cooling flows and other corrections. 
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Fig. 5. — The expected X-ray flux of clusters detected as peaks 
in the 353 GHz channel with 40 /^K of noise per 5' beam in our 10 
maps of 100 sq. deg. each. 



7 



For example, Allen & Fabian H1998I Model C) quote a con- 
version between temperature and (bolometric) luminosity 
as 



L = 5 X 10 



44 



T 



6keV 



2.33 



h ergs s 



(7) 



For comparison, a slightly different conversion is given by 
Markevitch 119981) 



L = 3 X 10 



44 



6keV/ 



2.64 



h ^ergss ^ 



(8) 



We shall use the Markevitch relation, with its more con- 
servative assumptions, and assume that the L — T relation 
does not evolve with redshift, but the difference between 
these two relations should be taken as an indication of the 
uncertainty. We apply the bolometric and K-corrections 
from Romer et al. (200T) and compute the flux from the 
X-ray luminosity through 



(9) 



where Dl is the luminosity distance to the cluster redshift. 
The corrections work at the few percent level, compared to 
the exact calculations using XSPEC fArnaud ll996|l which 
is more than adequate for our purposes (A.K. Romer, pri- 
vate communication). Throughout we shall quote fluxes 
in the 0.5 — 2.0keV band. Using these scalings the dis- 
tribution of fluxes is as shown in Fig. [3 peaking near 
10~^^ ergss"^ cm~^ with a tail to 10"^'^ and a bright end 
as high as 10^^". Many of these clusters will be detectable 
in reasonable (several ksec) integrations with existing X- 
ray telescopes if they are still operating. Of the exist- 
ing X-ray surveys, those based on the RASS (Triimper 
et al. ITMTl Voges et al. [TMSl e.g. Reflex: Bohringer 
et al. mm MACS : Ebehng, Edge & He nry |2 001. NEP: 
Henry et al. VMU BCS: Ebehng et al. have the 

combination of sensitivity and sky coverage to provide the 
best pre-launch catalogues for Planck. In the fields where 
XMM and Planck overlap there should be strong X-ray 
detections in the XCS (Romer et al. l^niIT|l . There wifl 
additionally be many clusters in the deep optical surveys 
which will be completed by the time Planck launches (see 
below). 

Optical and near-IR emission is still the least expensive 
way of measuring cluster redshifts. Without redshift infor- 
mation, the distance to the cluster is essentially unknown 
and so the physical interpretation of the cluster sample 
depends on our ability to detect clusters in optical light 
and determine their redshifts. 

In order to determine how many clusters we could find 
or study from optical imaging it is useful to have an es- 
timate of the cluster richness. For our purposes a rough 
estimate suffices, so we model the (R-band) luminosity 
function of cluster galaxies after that of Coma, for which 
a Schechter function with = —20.9 and a = —1.2 is 
a reasonable fit (Beijersbergen et al. '2002|). The normal- 
ization should scale roughly with the mass, (j)^ oc M, but 
the overall value is uncertain. We will assume 20 galaxies 
within the virial radius brighter than for a 10^^ h~^MQ 
halo. The number in the core region, where the contrast 
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Fig. 6. — The number of galaxies brighter than R = 25 within the 
virial radius of our Planck cluster sample. The number within the 
core region, where the contrast against the background is higher, 
would be about 10% of Afgai(< r'20o)- Again the clusters are those 
in the 353 GHz channel with 40 /jK of noise per 5' beam and 10 
maps of 100 sq. deg. each. 




Fig. 7. — The weak lensing signal-to-noise, assuming the pa- 
rameters described in the text, of clusters detected as peaks in the 
353 GHz channel with 40 of noise per 5' beam and 10 maps of 
100 sq. deg. each. 
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against the background is the highest, wiU obviously be 
smaller. If the galaxies follow the mass, approximately 
20% of the galaxies lie within the break radius (0.2r2oo) 
and 8% within the core radius (0.1r2oo) of a rich cluster. 
Assuming pure luminosity evolution, scales as lO*^'^^ 
(van Dokkum et al. I1998|l . For simplicity we use the K- 
corrections appropriate to elliptical galaxies for the Sloan 
r' band from Fukugita, Shimasaku & Ichikawa 1)1995(1 . 

We see from Fig. Eland Table that we will find a signif- 
icant number of very massive, high-z clusters which should 
contain many galaxies within the virial radius brighter 
than R — 25 (Scr photometry to i? = 25 takes approxi- 
mately 1 hour on a 4m class telescope). Even taking into 
account that we will have to work at a small fraction of the 
virial radius, the Planck sample will be ideal for studying 
the galaxies in the most massive and distant clusters. Since 
the Planck sample is biased towards the most massive clus- 
ters, which should contain a red-sequence, this also sug- 
gests that deep 2-band photometry would be an efficient 
way to verify cluster candidates and obtain an approxi- 
mate redshift (Gladders & Yee l2()()()|l for those which have 
not already been detected by SDSS (Bartelmann 12001)) 
or other galaxy surveys. Once the candidates are con- 
firmed as clusters, with an approximate redshift, it will be 
possible to target a sub-sample for spectroscopy. Above 
R — 22, which is achievable on a 10m telescope with a 
moderate (30min) integration, the number of galaxies is 
approximately 1/3 of the number shown in Fig.El 

The gravitationally lensing subsample of the Planck 
clusters has been investigated by Bartelmann (2001), who 
showed that many of the clusters Planck will detect should 
also give measurable weak lensing signatures. Following 
Bartelmann we have computed the S'-statistic, giving the 
signal to noise for 30 galaxies per arcmin^ at z — 1.5, 
for the weak lensing signal on 1'. This is optimistic for 
current observations, but may be attainable by future fa- 
cilities. Figure [7| shows that almost the entire Planck 
cluster sample will be excellent targets for weak lensing 
surveys, having extremely high signal to noise. Though 
line-of-sight projection can be a significant contaminant 
for weak lensing studies of the general cluster population 
(Metzle r, Wh ite & Loken OnUTl White, van Waerbeke & 
Mackev l2002p it is less of a problem for the Planck sam- 
ple which consists primarily of the most massive systems. 
Our conclusions are thus in line with, though slightly 
stronger than, those of Bartelmann who suggested that 
Planck clusters would form an excellent sample for weak 
lensing follow-up. It also follows from this that many of 
the Planck clusters will produce measurable lensing fea- 
tures in the CMB itself, although the resolution of Planck 
is not well matched to mapping the majority of these clus- 
ters. 

We show the cluster properties from our simple model in 
Table^for a range of redshifts. Because we are holding the 
mass fixed, the X-ray temperature (and thus luminosity) 
are increasing functions of redshift. The lensing signal 
peaks half way between z = and z = 1.5, around z ~ 0.6. 

7. COSMOLOGY DEPENDENCE 



We have assumed a particular cosmological and fore- 
ground model for our study, and it is important to ask how 
sensitive our results are to this assumption. Our neglect of 
dust emission is clearly optimistic. Our 'foreground noise' 
is matched to the SCUBA counts at 353 GHz, but these 
counts are uncertain at the 50% level, leading to a similar 
uncertainty in our noise level. We have assumed a fairly 
steep frequency dependence in extrapolating to lower fre- 
quencies, and this may not be true of all the sources. Con- 
versely, the brightest IR sources may also be bright in some 
other waveband (e.g. radio), which may provide additional 
leverage in removing them. On the cosmological side the 
ACDM paradigm seems to be fairly secure, and the val- 
ues of the matter density and Hubble constant we have 
chosen are quite standard. However all of the calculations 
presented above had the SZ signal normalized to match 
the CBI and BIMA results. If the CBI signal is not domi- 
nated by thermal SZ then the SZ signal from clusters may 
be lower than we have assumed. This scenario would sug- 
gest using a lower matter power spectrum normalization 
(fJs) than we have used throughout. 

To this end we have run another simulation, identical to 
that described in ^ except that erg — 0.8. As expected 
the lower erg reduces the number of the most massive and 
high redshift clusters. Our redshift distributions are then 
shifted slightly to lower z. The number of massive clusters 
which Planck can see is reduced compared to the num- 
ber of lower mass SZE sources, which slightly increases 
the confusion. Neither of these two effects alters our con- 
clusions, so the main impact is that there are fewer clus- 
ters overall. Simply rescaling the total number of clusters 
doesn't have a large impact on our conclusions, it only 
makes our statistics more susceptible to Poisson fluctua- 
tions from the relatively small simulation volume. Thus 
running the simulation with erg = 1 can be regarded as a 
simple way to increase the statistics on rare objects while 
keeping the box size small and hence the mass and force 
resolution high. 

We have used a cluster mass-temperature normalization 
close to that predicted by the hydrodynamic simulations. 
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Table 1 

Cluster properties, computed with our simple 

SCALING LAWS. ThE MASS IS QUOTED IN 10^^h~^MQ, 
LUMINOSITY IN 10''* ergS , FLUX IN 

10""'^ ergss""' cm~^, temperature in keV, lensing as a 
SNR and number of galaxies within the virial radius 

BRIGHTER THAN R = 25. 
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chosen to match the level of power seen by CBI^. How- 
ever, X-ray observations suggest that clusters at a fixed 
mass may be hotter than most simulations predict (see 
e.g. Table 1 of Muanwong et al. 2002 or Fig. 2 of Huterer 
& White I7U02|I . If this holds across the whole cluster, and 
the temperature estimated by X-ray observations is close 
to a mass weighted temperature, this would imply an in- 
crease in the SZ signal per cluster. An increase, at the 
20-50% level, in the SZ signal per cluster is possible. A 
simple rescaling of the noise, which is anyway uncertain at 
this level, would mimic any such change. 

Thus we expect that our simulations, while not perfect, 
should provide a fair guide to the expected size of the 
SZ signal based on our current knowledge. If anything the 
indications are that our simulations may predict too many 
clusters, especially at high redshift, and that each cluster 
may produce slightly too little SZ signal. At present it is 
not unreasonable to assign a factor of 2 uncertainty to the 
signal-to-noise in our maps. 

8. CONCLUSIONS 

The SZE offers a new and potentially very powerful 
method for finding high redshift clusters of galaxies, and 
the Planck mission will be unique in producing all-sky 
maps at the relevant frequencies with high angular res- 
olution and sensitivity. 

We have presented a preliminary investigation of the 
cluster sample which Planck should provide. We used 
mock SZ maps drawn from a large volume, high resolu- 
tion N-body simulation. These maps capture much of the 
physics behind the SZ effect, with the sources (groups and 
clusters) situated correctly in their cosmological context. 
To these maps we add 'noise' arising from the detectors 
and from incomplete foreground subtraction. Our fore- 
ground modeling has been highly simplistic and idealized, 
both because cluster finding is a local process and because 
we wish to decouple that degree of uncertainty from the 
main focus of this work. Uncertainty in cluster physics 
and our foreground model suggests that our signal-to-noise 
may be uncertain at (up to) the factor of 2 level. For this 
reason we have simulated a range of 'effective' noise ampli- 
tudes at fixed signal. Improving our normalization of the 
sources will require a moderate sample of clusters whose 
integrated SZ signal and X-ray temperature or velocity 
dispersion is known, for comparison with the simulations 
used here. Our high frequency foreground uncertainty will 
be improved by better measurements of the dust emission 
and the bright end of the source luminosity function and 
the frequency dependence of the sources. The sources of 
interest for Planck are the rarer sources brighter than a 
few tens of mJy. 

We have used combinations of only 2 frequency maps, 
plus vetoing regions where the 545 and 857 GHz maps show 
strong dust/sources, to disentangle the SZ signal from the 
CMB and astrophysical foregrounds. This is perhaps the 
simplest method, and the easiest to understand. It re- 

^Most of the increase in power in our simulations compared to the 
hydrodynamic simulations comes from our high assumed erg since at 

fixed M — T normalization Ce <x o-g*'''^^"^ where n is the effective 
spectral index. 



mains to be seen how it compares with more complex al- 
gorithms and more realistic treatments of foregrounds. 

We have found cluster candidates in our simulated dif- 
ference maps either by flagging local maxima or using a 
point-source optimized matched filter algorithm. We find 
the former is at least as successful as the latter, suggesting 
that even at 5' a significant fraction of the detections will 
not be beam shaped and would be missed by algorithms 
optimized to find point sources. For our lowest noise lev- 
els wc show that Planck detects almost all of the nearby, 
massive clusters and a significant fraction of the massive 
clusters out to z ~ 1. The completeness in our 10 fields 
ranges from 40-70% for all clusters above 5 x lO" h'^MQ. 
At intermediate noise levels the range is 10 — 30% while at 
the highest noise level we consider (our fiducial model), our 
algorithm recovers only a small fraction of even the richest 
clusters (2 — 10%). We expect the foreground contamina- 
tion to be less at lower frequencies, and even with the cor- 
responding decrease in angular resolution cluster finding is 
better done with 217 GHz and below if the signal-to-noise 
ratio in a given region of sky is as low as in our fidu- 
cial model. The combination of candidates from both the 
high and low frequency methods, perhaps used in different 
parts of the sky, would provide an interesting cross check 
on foreground contamination which we shall defer to fu- 
ture work. The uncertainties in the amplitude of the power 
spectrum, the signal and noise levels, the sky distribution 
of the foregrounds and the optimal methods for identi- 
fying clusters make predictions of the number of clusters 
that will be found by Planck highly uncertain. Nominally 
Planck should detect from several thousand to more than 
ten thousand rich clusters. Even very pessimistic assump- 
tions give the return at several hundred clusters, suggest- 
ing that the Planck catalogue will be a fertile ground for 
future investigations. 

Using simple minded scaling arguments we have esti- 
mated the X-ray, optical and weak lensing properties of 
the Planck cluster sample. We find that most of the sam- 
ple will be strong weak lensing sources, and will contain a 
large number of galaxies. The X-ray emission from many 
sources will be detectable with relatively deep integrations 
with existing facilities should they still be operational. 
However, given the high-z and all-sky nature of the ex- 
pected cluster yield, pre-launch samples will most likely 
be constructed with dedicated SZ instruments, optical or 
weak lensing surveys. 

An all-sky sample of massive clusters with a well under- 
stood selection function would be a goldmine for cosmol- 
ogy. Planck should be the first mission capable of produc- 
ing such a catalogue. However the 5' resolution of Planck, 
along with noise from astrophysical and detector sources, 
make understanding of the selection function highly non- 
trivial. Better algorithms for extracting clusters from the 
multi-frequency maps need to be developed and more real- 
istic simulations performed before precision cosmology can 
be extracted from future Planck SZE cluster observations. 
To aid in this endeavor the maps, cluster catalogues and 
derived quantities from this work are available to the pub- 
lic at http://mwhite.berkeley.edu/ It is to be hoped that 
these simulations will be replaced by deep, high resolution, 
wide field SZE survey data as it becomes available. 
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